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Abstract 

Ancient mitochondrial DNA has been used in a wide variety of palaeontological and archaeological studies, 
ranging from population dynamics of extinct species to patterns of domestication. Most of these studies have 
traditionally been based on the analysis of short fragments from the mitochondrial control region, analysed 
using PCR coupled with Sanger sequencing. With the introduction of high-throughput sequencing, as well 
as new enrichment technologies, the recovery of full mitochondrial genomes (mitogenomes) from ancient 
specimens has become significantly less complicated. Here we present a protocol to build ancient extracts 
into Illumina high-throughput sequencing libraries, and subsequent Agilent array-based capture to enrich 
for the desired mitogenome. Both are based on previously published protocols, with the introduction of 
several improvements aimed to increase the recovery of short DNA fragments, while keeping the cost and 
effort requirements low. This protocol was designed for enrichment of mitochondrial DNA in ancient 
or degraded samples. However, the protocols can be easily adapted for using for building libraries for 
shotgun-sequencing of whole genomes, or enrichment of other genomic regions. 


1. Introduction 

T He main characteristics of ancient DNA molecules are their low abundance in biological 
remains and the presence of molecular damage accumulated over time. Most of these 
damages are nicks caused by hydrolysis, which results in the progressive fragmentation of the 
DNA molecule into increasingly shorter fragments. The analysis of ancient samples is therefore 
parficularly challenging, and for many years fhe principal focus of ancienf DNA sfudies has been 
on mitochondrial DNA (mtDNA), as it is expected to be better preserved than nuclear DNA 
because of ifs higher copy number in fhe living cell. In such sfudies, shorf mfDNA fragmenfs are 
fypically fargefed with specifically designed primer pairs and amplified for Sanger sequencing. 
However, fhere are a number of problems associated with this approach. For example, mismatches 
between the primers and target sequence can cause the reaction to work suboptimal (or even not 
at all). This is particularly problematic in the analysis of extinct species, or other species for which 
there is no reference available for primer design. Additionally, as most ancient DNA molecules are 
below 40 bp due to hydrolytic damage, successful amplificafion is challenging af besf (ll]|. Finally, 
as modern contaminant DNA is generally longer, this approach is very sensitive to contamination, 
demanding replications of fhe experimenfs fo validafe fheir resulfs ([2). 

Wifh fhe infroducfion of Nexf Generation Sequencing (NGS), fhe analysis of ancienf DNA has 
imdergone a tremendous advance. NGS involves the preparation of genomic libraries from the 
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entire DNA extract by ligating adapters to each end. This means that species-specific primers 
are no longer necessary, effectively allowing every single fragment present in the extraction to be 
sequenced. Using such a shotgun-sequencing approach, the entire genomes from ancient modern 
humans (|3-l5). Neanderthals (|6)l, Denisovans (|7l |8) and horse ((U have been sequenced. However, 
while whole ancient genomes are certainly possible these days, we should not disregard the 
power of mitogenomes. Mitogenomes represent an interesting and unique marker, particularly to 
address population genetics, and has been used in many ancient DNA studies dTOb . For example, 
mitogenomes are still the marker of choice for samples with very poor preservation dill [12), or 
when comparing ancient samples to their known mitochondrial diversity dlSllMl . 

As described above, the amount of DNA extracted from ancient samples is usually limited, 
with a low ratio of endogenous versus environmental and contaminant DNA. In general, the 
endogenous content is less than 1% (e.g. 0 - only in very rare cases this amount exceeds 50%: 
under favourable preservation conditions (e.g. [15) or for particular sample types (e.g. [16). This 
means that when an ancient DNA library is directly sequenced, substantial sequencing is needed 
for each sample to recover sufficient coverage for the genomic region of interest - an approach 
which is expensive and labour-intensive. 

To circumvent these problems, there are a number of methods available to enrich sequencing 
libraries for the desired DNA ((also referred to as hybridization capture or just capture; for reviews 
see [El®. This approach works by creahng bait molecules with high sequence similarity to 
the target, which then hybridize to the desired DNA. Bait and target are then immobilized so 
that non-hybridized fragments can be washed away. The resulting library should then contain 
a much higher ratio of target versus contaminant DNA, so that subsequent sequencing yields 
much more usable data. Additionally, contrary to traditional PCR, hybridization capture favours 
smaller fragments ([19) ~ effectively introducing a bias against any modern contaminants, as these 
fragments are generally expected to be longer than ancient DNA. Many of the available capture 
protocols have already been applied to archaeological specimens, demonstrating the tremendous 
advantages of this approach (e.g. IT3ll20ti23t . 

Here we present a protocol to build ancient DNA extracts into Illumina NGS libraries, suitable 
for shotgun sequencing of whole genomes, or enrichment of a particular target sequence. We 
subsequently describe a protocol to perform Agilent array-based capture to enrich for the complete 
mitochondrial sequence. Both are based on previously published protocols (i24l [25| respectively), 
with the introduction of several improvements aimed to: 1. increase the recovery of short DNA 
fragments, which is characteristically the majority of ancient DNA templates; and 2. optimize 
the pooling of multiple samples, in order to reduce the cost and time of laboratory procedures in 
ancient population studies. The protocols described below are performed on extracts generated 
by the silica-based spin column extraction method by Rohland et al. {[26) - however, the library 
preparation and capture is compatible with any extraction protocol that best suits the sample. 

1. Illumina sequencing library construction 

The preparation of NGS libraries from ancient (or other degraded) DNA extracts requires protocols 
optimized for this particular purpose (i27li29]| . In general, NGS library preparation methods require 
a number of consecutive enzyme reactions, and therefore include purification steps in between. 
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either by solid phase reversible immobilization (SPRI) beads or silica-based filter columns. These 
purification steps may cause significant loss of template - particularly the shortest fragments, which 
is particularly detrimental in the case of ancient samples. A recently published library preparation 
method has proven very effective for ancienf samples, mainly because of fhe elimination of 
purificafion sfeps (l29t . However, fhe single-sfranded library preparation method is particularly 
costly and time consuming, making the double-stranded protocols generally still the method of 
choice for projecfs wifh larger number of samples. 

The library preparafion profocol described here is based on fhe original profocol for home¬ 
made Illumina libraries presenfed by Meyer and Kircher (|24l . wifh fhree main adapfafions for 
increased sensifivify. Firsf, we have reduced fhe number of purificafion sfeps from fhree down fo 
only one, by replacing fhe column-based purificafion by heaf inactivation of fhe enzymes (Fig. 1). 
This modification helps to avoid the loss of the shortest DNA fragments, particularly after the first 
enzyme reaction (blunt-end repair), at which stage DNA is not yet ligated to the adapters and the 
risk of losing fhe smallesf fragmenfs is highesf. Another adaptation in our library preparation 
is the use of a second index on fhe 5' end of fhe ancienf DNA femplafe. The imporfance of 
such double-indexing has been discussed previously (I30ll3il . Double-indexing serves as an exfra 
idenfificafion of fhe sequence, and fhus allows for pooling of more samples on a single lane. In 
addition fo fhis, since fhe indices are locafed af bofh ends of fhe femplafe, double-indexing also 
provides a means of filtering chimeric reads (or jumping-PCR) from the dataset, and thus increases 
the confidence wifh which a parficular read can be assigned fo fhe corresponding library. The 
indexing method that we utilise does not require an additional sequence read; the P5 index is 
retrieved as part of fhe regular read 1 (Fig. 1, Nofe 1). Finally, we infroduced parallel PCR's fo 
mainfain more complexify of fhe library during amplificafion. As fhe amounf of endogenous 
femplafe molecules in ancienf samples is generally low, fhe amplificafion of fhe library can 
infroduce biases (i.e. preferenfial amplificafion of cerfain fragmenfs). We reduce fhis loss of 
complexify by amplifying every sample in a number of parallel PCR reactions (usually four), each 
confaining a unique subsef of fhe original library as sfarfing femplafes. 

II. DNA hybridization capture 

The enrichmenf mefhod described in fhis profocol is a solid-sfafe capfure ufilizing high-densify 
filing DNA microarrays fo immobilize fargef regions (affer lT^l25l . Microarray capfure has been 
successfully applied fo archaeological specimens (|2T1 l22l l32ll and has some advanfages over in- 
solufion capfure. Solid-sfafe capfure can be parficularly inferesfing when fhe fargef region is 
large (i.e. >1 Gb), since in such cases buying or creating fhe baifs is rapidly becoming expensive 
and labour-infensive. In addifion, confrary fo many rn-solufion capfure protocols, array capfure 
avoids unequal representation of baifs (e.g. repetitive regions), which can lead to uneven or patchy 
coverage of the target regions ([25l . The protocol described here is based on the protocol by Hodges 
et al. (f25]) , with four adapfafions fo facilifafe efficiency and sensifivify (Fig. 2). Firsfly, we have 
eliminated fhe species-specific COT I DNA in fhe hybridization mixfure. In fhe original profocol 
COT I is added fo prevenf over-represenfafion of repefifive elemenfs, which can be infroduced 
by spurious hybridizafion of femplafe molecules fo fhe overhang of capfured molecules instead 
of baifs (l25ll . However, we have found fhaf exuding COT I does nof have an adverse effecf when 
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1-2. Blunt end repair 




Figure 1: Library preparation procedure and amplification. 1-2. The first step is the blunt-end repair, which removes 
any overhangs resulting from hydrolytic damage. The T4 DNA polymerase fills in or removes 5' and 3' 
overhangs, respectively, while T4 PNK polymerase phospohorylates the 5' ends (round edges). 3-4. The 
Illumina-specific adapters (P5 and P7) are ligated to the template molecule by the T4 DNA ligase. As the P5 
adapter includes an index sequence (position indicated in red), the library is already indexed from this step 
onwards. 5. The adapters that are not incorporated into the library, will be discarded during the purification 
using silica-based columns (MinElute PCR Purification columns). 6-7. The fill-in reaction uses a Bst 
polymerase with strand displacement activity that fills in the nicks and completes the double strand sequence 
of the adapter. 8-9. A primary PCR with primers IS7 and IS8 is used to amplify the library in a quantity 
that suits the requirements of the capture procedure. 10. The excess of primers will be discarded during 
purification using MinElute PCR purification columns. 11. There are several options to test the success of the 
primary amplification; either by electrophoresis on a 2.5% Agarose gel, or Agilent Bioanalyser/Tapestation. 
12-13. During the indexing PCR, 5'-tailed primers (IS4 and P7 index primer) are used to reconstruct the 
full sequence of the Illumina-specific adapters. The red label in the P7 indexing primer represents the position 
of the P7-index. When more than one sample is built into library, each of the described steps has to be carried 
out separately for each sample. Only after the indexing PCR, the libraries are ready to be pooled. After 
pooling the libraries, amplification can be performed using the IS5 and IS6 oligonucleotides (oligos). All oligo 
sequences can be found in Table 1. 
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capturing ancient DNA, possibly because ancient DNA fragments rarely exceed the bait length 
of 60 bp. Secondly, we exclude fhe concenfrafion of fhe eluafe using a SpeedVac, fo prevenf any 
pofential loss of femplafe during fhis sfep. Third, affer elufion of fhe enriched library from fhe 
array, we have infroduced addifional parallel PCR reachons - bofh fo accounf for fhe increased 
elufion volume due fo fhe eliminafion of fhe SpeedVac sfep, as well as fo mainfain complexify of 
fhe enriched libraries during amplificafions as discussed above. Finally, if has been found fhaf 
performing a second capfure on fhe already enriched maferial will increase fhe enrichmenf rafe 
m - fherefore we have infroduced a second capfure round in our profocol, idenfical fo fhe firsf 
enrichmenf. While fhis sfep does increase fhe fime and money required for fhe enrichmenf, fhis 
does not outweigh the increased yield of usable sequence data. We have also found fhat, provided 
that the array is stored dry and not too long, it can be re-used a second time for capfure. The 
profocol described here has been applied successfully fo capfure mitogenomes as well as nuclear 
DNA. 



Figure 2: Solid-state hybridization capture using a custom-designed Agilent microarray. The capture is performed either 
on pooled and indexed libraries, or on primary, pre-indexing libraries (A). Blocking oligos that correspond to 
the adapter sequences are added to prevent these from interfering during the hybridization capture (B). Then, 
the mixture is loaded on the array and hybridized for 60-65 h (C). The result is a library enriched for the 
target region (in this case, mitogenomes) (D). 
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11. Materials 

1. Oligonucleotides (oligos) as adapted from Meyer and Kircher, 2010 (Note 1-5): 


Table 1: Sequences for all oligonucleotides required for hybridization capture (originally described in l24l t 


Name 


Sequence 


Notes 


P5_ind_F_xxx A‘C*A*C*TCT TTC CCT ACA CGA CGC TCT TCC GAT CT xxxxWx*x 


P5_md_R_xxx 

P7_adapter_F 

P7_adapter_R 

154 

155 

156 

157 

158 

Indexing_primer_xxx 

BO P5_trun_F 
BO P5_trun_R 
BO P5_ext_F 
BO P5_ext_R 
BO P7_trun_F 
BO P7_trun_R 
BO P7_ext_F 
BO P7_ext_R 


x'xWxxxx AGA*T*C*G*G 

G’LG'A’CTG GAG TTC AGA CGT GTG CTC TTC CG*A*T‘C*T 
A‘G‘A‘LCGGAA*G‘A’G‘C 
- AAT GAT ACG GCG ACC ACC GAG ATC TAC ACT - 
CTT TCC CTA CAC GAC GCT CTT 
AAT GAT ACG GCG ACC ACC GA 
CAA GCA GAA GAC GGC ATA CGA 
ACA CTC ITT CCC TAC ACG AC 
GTG ACT GGA GTT CAG ACG TGT 
CAA GCA GAA GAC GGC ATA CGA GAT xxxxxxxx - 
- GTG ACT GGA GTT CAG ACG TGT 
ACA CTC ITT CCC TAC ACG ACG CTC TTC CGA TCT-Pho 
AGA TCG GAA GAG CGT CGT GTA GGG AAA GAG TG-Pho 
ATC TCG TAT GCC GTC TTC TGC TTG-Pho 
CAA GCA GAA GAC GGC ATA CGA GAT-Pho 
AGA TCG GAA GAG CAC ACG TCT GAA CTC CAG TCA C-Pho 
GTG ACT GGA GTT CAG ACG TGT GCT CTT CCG ATC T-Pho 
ATC TCG TAT GCC GTC TTC TGC TTG-Pho 
CAA GCA GAA GAC GGC ATA CGA GAT-Pho 


"x" indicates unique index sequence (Note 19) 

* phosphathioate oligonucleotide (PTO) bond 
"x" indicates unique index, * indicates PTO bond 

* indicates PTO bond 

* indicates PTO bond 


"x" indicates unique index sequence 

Pho indicates a 3'-end phosphate 

3'-end phosphate 

3'-end phosphate 

3'-end phosphate 

3'-end phosphate 

3'-end phosphate 

3'-end phosphate 

3'-end phosphate 


2. Reagents 

1. Oligo hybridization buffer (lOx): 500 mM NaCl, 10 mM Tris-HCl (pH 8.0), 1 mM EDTA 
(pH 8.0) 

2. Resfricfion buffer (lOx Buffer Tango, e.g. Fermenfas) 

3. T4 DNA ligase, wifh corresponding buffer and PEG-4000 (5 U/pL, e.g. Fermenfas) 

4. T4 DNA polymerase (5 U / pL e.g. Fermenfas) 

5. T4 pol 5 mucleofide kinase (10 U/pL, e.g. Fermenfas) 

6. ATP (100 mM) 

7. Bst DNA polymerase, large fragment, supplied with corresponding Thermopol Reaction 
Buffer (e.g. New England BioLabs) 

8. AmpliTaq Gold (Applied Biosysfems, supplied wifh lOx AmpliTaq buffer and 25 mM 
MgC12) 

9. dNTP's (e.g. Invifrogen) 

10. Bovine Serium Albumin (BSA, e.g. Promega) 
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11. Agilent aCGH/ Chip-on-Chip Wash Buffer 1 (Agilenf, caf no. 5188-5221) 

12. Agilenf aCGH/ Ghip-on-Ghip Wash Buffer 2 (Agilenf, caf no. 5188-5222) 

13. Oligo aGGH & GhIP-on-Ghip Hybridizafion Kif (Agilenf, caf no. 5188-5220), confains 
2X Hi-RPM Hybridizafion Buffer and lOX Oligo aCGH/ChIP-on-Chip Blocking Agenf. 
The Blocking agenf needs fo be resuspended by adding 1,350 pL nuclease-free wafer. 
Incubafe af room femperafure for 60 min and mix genfly. 

14. Qiagen MinElufe PGR purificafion columns (Qiagen, caf no. 28004), confains Qiagen 
Mrnelufe purificafion spin columns, PB binding buffer, PE washing buffer and EB 
elufion buffer, as well as Loading dye and pH indicator (nof used). Before use, add 
efhanol (96-100%) fo PE buffer (see boffle label for volume). 

3. Equipmenf 

1. Lab suifable for ancienf DNA work: i.e. physically isolafed from labs where work on 
modern DNA is performed, sterile working environment and consumables, appropriate 
protective clothing. 

2. Thermal cycler in cleanlab and modern lab (e.g. Veriti Thermal Gycler, Life Technolo¬ 
gies). 

3. Microcenfrifuge in cleanlab and modern lab (e.g. Labnef Specfrafuge 24D). 

4. Microarray hybridizafion oven, temperature range up to 95°G (section 9 and 18; e.g. 
Affymetrix Model 777 Hybridization Oven) (Note 6). 

5. Incubator oven (37°G; e.g. Labnet Mini Incubator with Mini Lab Roller). 

6. SureHyb DNA Microarray Ghamber (step 4-19) (Agilent). 

7. Microarray gasket slides (Agilent). 

8. Slide spirmer or centrifuge fiffed wifh microplafe adapfers. 

9. Slide rack. 

10. 3 slide-sfaining dishes. 

11. 1 big dish (big enough fo envelop one of fhe slide sfaining dishes wifh sufficienf room 
fo spare). 

12. Healing blocks (95°G and 37°G). 

13. Magnefic hof plate and sfirrers. 

14. 30G needles and syringes. 

4. Preparation of indexed adapter mixes 

1. Prepare fhe P5 and P7 adapter mixes as follows: 

1. P5 indexed adapter mix ('xxx' indicates index idenfifier): 

1. 1 pL of oligo P5_index_F_xxx (500 pM) 

2. 1 pL of oligo P5_index_R_xxx (500 pM) 
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3. 1.25 |a.L of Oligo hybridization buffer (lOx) 

4. 9.25 i^L of H20 

2. P7 adapfer mix 

1. 1 pL of oligo P7_adapfer_F (500 pM) 

2. 1 pL of oligo P7_adapfer_R (500 pM) 

3. 1.25 pL of Oligo hybridization buffer (lOx) 

4. 9.25 pL of H20 

2. Incubafe fhe fwo mixfures for 10 sec at 95°C, followed by a ramp from 95°C to 
12°C at a rate of 0.1°C/sec. 

3. Pool fhe two mixtures (25 pL total volume), resulting in a ready-to-use indexed 
adapter mix containing 20 pM of each adapfer. 

4. Store adapfer mixes af -20°C. 

III. Methods 


1. Array design (Note 7) 

1. Refrieve the mitogenome for the desired species (or close relative) from GenBank 
(hffp://www.ncbi.nlm.nih.gov/nucleofide/) - if more fhan one sequence is available, 
if is advisable fo include fhem all in fhe array-design fo mainfain fhe range of known 
sequence diversify of fhe species of inferesf. 

2. Use RepeafMasker (hffp://www.repeafmasker.org/; A.F.A. Smif ef ak, unpublished 
dafa) fo mask repeats. In some cases, the tRNA's will be masked as well, if desired 
fhese can be unmasked and included in fhe baifs. 

3. Creafe baits of 60 bp lengfh wifh 1 or 2 bp tiling space, and use these to design the 
Agilent SureSelect DNA capture array (either 244k or IM features). 

2. Library preparation 

1. Set up a master mix for blunf-end repair. Include per reaction (Nofe 8): 

1. 3.56 pL of H20 

2. 3.5 pL of Buffer Tango (lOx) 

3. 0.14 pL dNTP's (25 mM each) 

4. 0.25 pL of ATP (100 mM) 

5. 1.75 pL of T4 Pol 5 mucleofide kinase (10 U/pL) 

6. 0.7 pL of T4 Polymerase (5 U/pL) 

2. Add 10 pL of blunf-end repair masfer mix fo 25pL of ancienf DNA exfracf. Mix genfly 
by pipetting up and down, and incubate in a thermal cycler at 25°C for 20 min, followed 
by heaf inactivation sfep af 72°C for 20 min, fhen hold af 12°C. 

3. Sef up a masfer mix for adapfer ligafion, wifh per reacfion: 
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1. 2.5 pL of H20 

2. 5 |i.L of T4 ligase buffer (lOx) 

3. 5 gL of PEG-4000 (50%) 

4. 1.25 gL of T4 Ligase (5 U/gL) 

4. Add 1.25 gL of a unique indexed adapfer mix fo each 35 gL sample and mix carefully 
but thoroughly. Then add 13.75 gL of adapter ligation master mix to each sample (Note 
9). Mix gently and incubate in a thermal cycler at 22°C for 30 min, fhen hold af 12°C. 

5. Purify fhe reacfion using a MinElufe PCR purificafion column. Add per reaction 100 
gl Qiagen PB buffer. Centrifuge for one minute and discard flow-through. Add 750 
gl Qiagen PE washing buffer and cenfrifuge again af fhe previously given speed and 
fime. Discard flow-fhrough and repeal cenfrifugafion. Place fhe Qiagen MinElufe in a 
new tube, and add 10 gL Qiagen EB buffer. Incubate for 5 minutes and centrifuge af 
fhe same speed and fime. Add anofher lOgl of EB buffer and incubate for 5 minutes, 
and centrifuge. All cenfrifugafion sfeps are carried ouf af 17,900 x g (13,000 rpm) in a 
fablefop microcenfrifuge af room femperafure. 

6. Sef up a masfer mix for adapfer fill-in, wifh per reacfion: 

1. 14.1 gL of H20 

2. 4 gL of Thermopol buffer (lOx) 

3. 0.4 gL of dNTP's (25 mM each) 

4. 1.5 gL of Bsf Polymerase (8 U/gL) 

7. Add 20 gL of adapter fill-in masfer mix fo each 20 gL sample. Mix genfly and 
rncubafe in a fhermal cycler af 37° C for 20 min, followed by heaf inactivation step at 
80°C for 20 min, fhen hold af 12°C 

8. Por fhe primary amplification of fhe libraries, sef up a primary amplificafion master 
mix as follows, wifh per parallel reacfion: 

1. 7.8 gL of H20 

2. 2 gL of AmpliTaq Gold buffer (lOx) 

3. 1.6 gL of MgC12 (25 mM) 

4. 1.5 gL of primer IS7 (10 gM) 

5. 1.5 gL of primer IS8 (10 gM) 

6. 0.2 gL of BSA (10 mg/mL) 

7. 0.2 gL of dNTP's (25 mM each) 

8. 0.2 gL AmpliTaq Gold Polymerase (5 U/gL) (Nofe 10) 

9. Add 15 gL of master mix to each 5 gL sample (Note 11). Mix gently and amplify 
according fo the following femperafure profile: initial denaturation 94°C for 10 min, 
followed by 15 cycles of 94°C for 30 sec, 60°C for 45 sec, and 72°C for 45 sec, followed 
by a final exfension af 72°C for 5 min (Nofe 12). 
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10. Purify the pooled parallel reactions using a MinElute PCR purification column, accord¬ 
ing to the manufacturers' instructions. Elute twice with 10 pL EB buffer, each wifh 5 
min incubafion fime. 

11. Verify the success of the primary amplification on a 2.5% Agarose gel or Agilent 
Bioanalyser / Tapestation. 

12. Set up the master mix for fhe indexing PCR, wifh per parallel reacfion: 

1. 9.3 pL of H20 

2. 2 pL of AmpliTaq Gold buffer (lOx) 

3. 1.6 pL of MgC12 (25 mM) 

4. 1.5 pL of primer IS4 (10 pM) 

5. 0.2 pL of BSA (10 mg/mL) 

6. 0.2 pL of dNTP's (25 mM each) 

7. 0.2 pL AmpliTaq Gold Polymerase (5 U/pL) (Nofe 10) 

13. Add 1.5 pL of a unique index primer (10 pM) to each 5 pL sample (Note 11) and 13.5 pL 
of masfer mix. Mix genfly and amplify according fo fhe following femperafure profile: 
inifial denafurafion 94°C for 10 min, followed by 10 cycles of 94°C for 30 sec, 60°C for 
45 sec, and 72°C for 45 sec, followed by a final exfension af 72°C for 5 min (Nofe 12). 

14. Purify fhe pooled parallel reacfions using a MinElufe PGR purificafion column, as 
described in secfion 3.5. 

15. Verify the success of the primary amplification on a 2.5% Agarose gel or Agilent 
Bioanalyser / Tapestation. 

3. Capture 

1. Pool the libraries in equimolar amounts (Note 13,14) and bring the volume up to 168 
pL with H20. 

2. Prepare hybridization mix as follows (mainfain fhe order in which fhey are added): 


1. 

168 pL 

of fhe 

pooled library 

2. 

5 pL 

of 

oligo 

BO_P5_frun_F (200 pM) 

3. 

5 pL 

of 

oligo 

BO_P5_frun_R (200 pM) 

4. 

5 pL 

of 

oligo 

BO_P5_exf_F (200 pM) 

5. 

5 pL 

of 

oligo 

BO_P5_exf_R (200 pM) 

6. 

5 pL 

of 

oligo 

BO_P7_frun_F (200 pM) 

7. 

5 pL 

of 

oligo 

BO_P7_frun_R (200 pM) 

00 

5 pL 

of 

oligo 

BO_P7_exf_F (200 pM) 

9. 

5 pL 

of 

oligo 

BO_P7_exf_R (200 pM) 


10. 52 pL of Agilenf Blocking agenf (lOx) 

11. 260 pL of Agilenf Hybridization buffer (2x) 

3. Incubafe fhe mixfure af 95°C for 3 min, followed by 37°C for 30 min (Nofe 15) 
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4. Disassemble the hybridization chamber and place a clean, dry gasket in the chamber. 

5. Load 490 pL of the mixture on the gasket (Note 16) 

6. Slowly load the array down on the gasket (numbered barcode facing up, Agilent-labeled 
barcode facing down (Reminder: Agilenf = Active) 

7. Reassemble fhe chamber, screw fighf fo secure. 

8. Make sure there are no stationary bubbles by rotating the chamber. If there are stationary 
bubbles, tap the chamber on a solid surface fo dislodge fhem. 

9. Hybridize for 60-65 h af 65°C under rofafion af 12 rpm. 

10. Affer 48 h (i.e. fhe day before fhe hybridizafion is complefe), preheaf fhe materials 
necessary for fhe washing sfep wifh washing buffer 2 (see sfep 1). Fill one bottle wifh 
H20, and one bottle with Agilent washing buffer 2, and incubafe overnighf at 37°C 
together with the glassware. 

11. Disassemble the hybridization chamber and move the whole gasket-array sandwich 
into a dish with washing buffer 1. Pry fhe slides apart (Note 17) and move the array 
to the slide holder and washing dish 1. Make sure the slide is not exposed to air for 
exfended periods of fime. 

12. Wifh genfle sfirring, incubafe af room femperafure for 10 min. 

13. Prepare the washing dish 2: put the small dish in the big dish. Fill the big dish with 
water and the small dish with Agilent washing buffer 2. This way, fhe washing buffer 
will mainfain a consfanf femperafure of 37°C. 

14. Move fhe slide holder wifh fhe array info fhe dish confaining washing buffer 2 

15. Wifh genfle sfirring, incubafe af 37°C for 5 min. 

16. Spin fhe slide for 1 min af 600 rpm fo dry. 

17. Load 490 pL nuclease-free water on a new gasket, and reassemble the hybridization 
chamber. Again, make sure there are no stationary bubbles. 

18. Incubate for 10 min at 95°C. If the temperature drops after opening the oven, start timer 
when oven reaches 90°C. 

19. Use a needle and syringe to extract the eluate: first, slightly loosen the screw of fhe 
hybridizafion chamber. Then push fhe needle through the rubber seal on the non- 
labelled side of fhe slides. If the needle does not go through, loosen the screw of 
fhe chamber a bit more and try again. Elute the entire volume from fhe array before 
disassembling fhe chamber. 

20. Affer capfure amplify fhe enriched library pool in 24 parallel reactions. Set up the 
amplification master mix as follows, wifh for each reaction (Nofe 18): 

1. 5.6 pL of H20 

2. 4 pL of AmpliTaq Gold buffer (lOx) 

3. 3.2 pL of MgC12 (25 mM) 

4. 3 pL of primer IS5 (10 pM) 
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5. 3 |xL of primer IS6 (10 |xM) 

6. 0.4 pL of BSA (10 mg/mL) 

7. 0.4 pL of dNTP's (25 mM each) 

8. 0.4 pL AmpliTaq Gold Polymerase (5 U/pL) 

21. Add 20 pL of masfer mix to each 20 pL sample. Mix gently and amplify according to 
the following temperature profile: initial denaturation 94°C for 10 min, followed by 
20 cycles of 94°C for 30 sec, 60°C for 45 sec, and 72°C for 45 sec, followed by a final 
extension at 72°C for 5 min. 

22. Purify fhe pooled parallel reacfions using fhree MinElufe PCR purificafion columns, as 
described in section 3.5. Elute every column twice with lOpL EB buffer, each with 5 
min incubation time. 

23. Verify the success of the primary amplification on a 2.5% Agarose gel or Agilent 
Bioanalyser / Tapestation. 

24. Repeat the capture (starting from step 3.1) (Note 19). 

4. Notes 

1. Interesting to note: because of the P5 index in-line at the beginning of read 1, the 
libraries can also be pooled and sequenced on a single-end sequence run and still be 
de-multiplexed afterwards. This is particularly interesting for ancient DNA, where 
paired-end sequencing in order to access longer sequences is not particularly useful. 

2. The blocking oligos mentioned here would also be compatible with the indexing method 
described in (32), as it has two independent blocking oligos for the P5 adapter. 

3. The indices are not actually blocked. However, because the indices are only 8 bp long, 
any spurious hybridization will be disrupted during the heated washing step. 

4. It is important to pay particular attention to the choice of P5 indices for samples thaf 
will be sequenced on a single lane: since fhe Illumina fechnology bases cluster calling 
on the first cycles of Rl, fhe indices must have sufficient base diversity from each ofher 
to facilitate this process. When a low number of samples with limited diversity between 
the indices are sequenced on one lane, it is advisable to add PhiX library (a proprietary 
control library, available from Illumina) fo fhe sample fo increase the base diversity in 
the first cycles. 

5. When designing the index sequences, several considerations must be taken into ac¬ 
count. The indices must be as different as possible, at least by three substitutions. 
The differences serve to avoid misidentification of the index sequence, either due to 
misincorporation of the nucleotides during library amplification or to sequencing errors. 
Illumina instruments only have two lasers, red for A/C and green for G/T; therefore for 
each position, the number of A and C bases should be well balanced with the number 
G and T bases for all indices used in the same experiment. For more information, see 

m- 
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6. For step 3.18; if no oven is available that rotates and goes up to 95°C, an alternative is to 
put the chamber on one side, incubate for 5 min, rofafe fo fhe ofher side and incubafe 
for anofher 5 min. 

7. The array design described here is meanf fo enrich fhe library for whole mifogenomes: 
if other sequences are desired, one can simple follow fhe same procedure wifh fhe 
desired sequence insfead of fhe mifogenome. 

8. We recommend coimfing one more reaction fhan fhe acfual number of samples due fo 
volume loss during pipetting 

9. To avoid fhe formafion of adapfer dimers, it is important to add the adapters to the 
sample, mix carefully by pipetting up and down, and only fhen add fhe masfer mix. 
If there are many adapter dimers in the final library, less adapter mix can be used. 
However, as this is blunt-ended ligation, adding too little of fhe adapfer mix can cause 
femplafes to ligate to each other instead of the adapters, causing a loss of femplafe (33). 

10. We have observed fhaf fhe use of Accuprime Supermix I enzyme (Invifrogen) reduces 
fhe formation of adapfer- and primer-dimers compared to AmpliTaq Gold. Like 
AmpliTaq Gold, Accuprime Supermix I includes a polymerase that can read over 
uracils, so it can also be used for fhe primary amplification of ancienf DNA femplafes. 
However, it is recommended for amplicons < 200 bp, so its use should be avoided when 
inserts over 100 bp are expected. 

11. These amounts are based on four parallel reactions per sample. If a different number of 
parallel reactions are required, please adjusf fhe amounfs accordingly. 

12. If fhe library is nof visible or weak, increase cycle numbers fo 20. Alfernafively, keep 
the number of cycles to 15 and reduce the elution volume to 10-12 pL of EB buffer. This 
will help fo visualize fhe library, while keeping low fhe risk of clonalify. 

13. Be aware when calculating the equimolar amounts of samples confaining adapter 
dimers - use the total concentration, not the concentration of individual bands. While 
fheoretically fhe adapfer dimers should not hybridize to the baits and be washed 
away, in practice a certain amount of dimers will still carry over to the post-capture 
library. When there is access to a Pippin prep (Target size selection platform by Sage 
Science, Beverly MA), fhis problem can be solved. We would not recommend using gel 
extraction for size selecfion of the library, as in our experience it does not work very 
well. 

14. Anywhere between 0.5 and 20 pg final amounf of DNA should work for enrichmenf, 
buf fhe more fhe beffer. 

15. Masking fape should be used to close the lid of fhe fube during fhe 95°C incubafion; 
parafilm may nof be sufficienf fo keep fhe lid closed. 

16. Distribute the mixture equally over the surface while keeping away from fhe rubber 
seal around fhe edges. 

17. If fhe slides are difficulf fo pry aparf, fry sficking a needle fhrough fhe rubber seal fo 
equalize fhe pressure. 
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18. When performing the protocol for the first time or when the results are hard to predict, 
we advise amplifying eighf parallel PCR reacfions firsf, and fhen fhe remaining 16. By 
doing so, one sfill has femplafe when opfimizafion of fhe masfer mix, cycle numbers, 
efc. is necessary. 

19. During fhe final amplification affer fhe second capfure round, if may be wise fo sef up 
8 parallel reacfions first, purifying fhem in one MinElute column, and fhen verifying 
the success of the reaction. If the library is too bright or too weak, adjust cycle numbers. 
Then amplify fhe remaining 16 parallels and fhen purify using 2 MinElufe columns. 
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